Circuit, system and method for controlling read latency

ABSTRACT

A read latency control circuit is described having a clock synchronization circuit and a read latency control circuit. The clock synchronization circuit includes an adjustable delay line to generate an output clock signal whose phase is synchronized with the phase of the input clock signal. The read latency control circuit captures a read command signal relative to the timing of the input clock signal and outputs the read command signal relative to the timing of the output clock signal such that the read command signal is outputted indicative of a specified read latency.

CROSS-REFERENCE TO RELATED APPLICATION

This application is a continuation of U.S. patent application Ser. No. 12/683,309, filed Jan. 6, 2010, U.S. Pat. No. 8,018,791, which is a continuation of U.S. patent application Ser. No. 11/724,910, filed Mar. 15, 2007, U.S. Pat. No. 7,656,745. These applications and patents are incorporated by reference herein in their entirety and for all purposes.

TECHNICAL FIELD

Embodiments of the invention relate to integrated circuits, and more specifically to controlling read latency of a memory device using a DLL loop.

BACKGROUND

Synchronous integrated circuits are clocked by an external clock signal and perform operations at predetermined times relative to the rising and falling edges of the external clock signal. For example, in dynamic random access memories (“DRAMs”), the timing of external signals, such as command, address and write data signals, is determined by the external clock signal, and the memory device must latch these signals at the proper times to successfully capture the signals. To latch these applied signals, an internal clock signal is developed in response to the external clock signal, and is conventionally applied to latches contained in the memory device to clock the external signals into the latches. The internal clock signal and external clock signal must be synchronized to ensure the latches are clocked at the proper times to successfully capture the external signals.

When a read request is made, a memory controller requesting the data expects data from the memory device to be available on a data bus within a predetermined read latency, which is usually a predetermined number of system clock cycles after the read request is made, e.g. eight external clock cycles. The memory device has its own internal clock system, which receives the external clock signal and develops from the external clock signal several different internal clock signals using a delay locked loop (“DLL”). Problems with maintaining read data latency arise in high speed DRAMs from the necessity to align data with the external clock using the internal clock signals generated by the DLL.

In order to meet a specified read latency the memory device must be able to count clock signals upon receiving a READ command and activate an output latch to provide the requested data relative to the read clock signal. The read clock is provided to the read latch with a desired phase relationship to the external system clock signal. During high speed operations, the amount of read clock delay relative to the read data valid time becomes essentially indeterminable, consequently making it very difficult to control the read clock signal to ensure a correct data output and a specified read latency measured in terms of external clock cycles.

One solution to these problems is disclosed in U.S. Pat. No. 6,687,185, which discloses synchronizing the variable timing of the internal clock signals derived from an external clock signal such that the read data and the read clock signal used to latch the read data arrive at the data latch in synchronism and with a specified read latency. With reference to FIG. 1, an external memory controller 102 supplies command and address signals C/A to a memory device 100 through a command/address buffer 112, which applies received buffered signals to either a command decoder 128 for decoding incoming commands from the memory controller 102 or an address decoder 132 for decoding incoming addresses from the memory controller 102. When a READ command is received, the command decoder 128 decodes the READ command and applies decoded signals to a read logic 136 control circuit that initiates a read operation on the memory array 140. The read logic 136 control circuit operates the memory array 140 to read out data from one or more memory addresses that are specified by an address received from the memory controller 102 and decoded by the address decoder 132.

The memory controller 102 additionally supplies an external clock signal EXTCLK to an input buffer 110 of the memory device 100. The input buffer 110 generates a buffered EXTCLK signal that is applied to a delay lock loop (“DLL”) 120, which generates internal clock signals for the memory device 100, including the read clock signal RDCLK. As previously discussed, the RDCLK signal is used to drive a read latch 148 so that the requested read data from the memory array 140 is properly latched to the respective DQ pad in a timeframe that meets the specified read latency. The input buffer 110 also applies the buffered EXTCLK signal to drive the command decoder 128 and the read logic 136 control circuit to decode the command and address signals in a timely manner.

The command decoder 128 additionally generates a RDSTART signal that has a fixed timing relationship to the RDCLK signal. The RDSTART signal is generated to track any changes in the RDCLK timing, and is also used to synchronize the read data from the memory array 140. When a READ command is decoded by the command decoder 128 the RDSTART signal is applied to a slave delay circuit 124, which models the DLL 120 and causes its output signal to be slaved to the RDCLK signal from the DLL 120. Therefore, any timing variations imparted on the RDCLK signal by the DLL 120 are also applied to the RDSTART signal. The slave delay circuit 124 outputs a delayed RDSTART signal that is applied to a counter 144, which also receives a specified read latency value from a mode register 130. The specified latency value may be predetermined at the time of manufacture, or may be a value from the memory controller 102. The delayed RDSTART signal enables the counter 144 to count down from an initial value using the read clock signal to the predetermined count value that represents the specified read latency for the memory device 100. Upon completing the count, the counter 144 signals the read latch 148 to synchronize latching the read data relative to the RDCLK based on the specified read latency.

Since the amount of delay that must be imposed on the RDCLK signal relative to when the read data is made available is indeterminable during high speed operations, it is very difficult to ensure the data output is correct and the specified read latency is met even with model delay circuits such as the slave delay circuit 124. In order for the slave delay circuit 124 to impart to the RDSTART signal the same delay that the DLL 120 imparts to the EXTCLK signal, the entire DLL 120 must be precisely modeled and the delays must be accurately matched. As is known to those skilled in the art, DLL circuits are generally large and cumbersome within the memory device space. Therefore, the trade-off of having a slave delay circuit 124 that accurately replicates the delay of the DLL 120 is that chip space consumption is doubled.

If other delay variations must be accounted for, additional model circuits and other compensative circuitry may also be required, such as offset circuits or additional counters. Such other circuits have been utilized to synchronize the read clock and the read data access within specified latency requirements. For example U.S. Pat. No. 7,065,001 entitled Method and Apparatus for Initialization of Read Latency Tracking circuit in High-Speed DRAM, which is assigned to the same assignee as the present invention, discloses a method of synchronizing counters in two different clock domains by generating a start signal for tracking a running count of clock pulses of a read clock signal. The start signal is generated by an initialization circuitry that includes an offset calculation. Various timing variations to the read clock signal are compensated in a circuit that includes a DLL, I/O model circuits and four separate counter circuits. As the demand for smaller high-speed memory chips continue to increase, any additional circuitry on the memory chip leads to significant loss of valuable chip space. However, if the read data is not provided at the DQ with a precise read latency, due to mismatched timing variations, the data will be corrupted and unusable.

Therefore, there is a need for a DLL circuit that accurately maintains a specified read latency while achieving a read operation synchronized to a read clock signal, but that also reduces the consumption of chip area.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a block diagram of a prior art read latency tracking circuit having a DLL circuit and a slave circuit modeling the DLL circuit.

FIG. 2 is a block diagram of a DLL circuit with read latency control according to an embodiment of the invention.

FIG. 3 is a block diagram of a DLL circuit configured to provide various signals to a read latency control circuit according to another embodiment of the invention.

FIG. 4 is a timing diagram of various signals during operation of the DLL circuit according to an embodiment of the invention of FIG. 3.

FIG. 5 is a functional block diagram illustrating a memory device that includes a DLL circuit according to an embodiment of the invention.

FIG. 6 is a functional block diagram illustrating a computer system including the memory device of FIG. 5.

DETAILED DESCRIPTION

Embodiments of the present invention are directed to a read latency control circuit controlled by a DLL circuit. Certain details are set forth below to provide a sufficient understanding of the embodiments of the invention. However, it will be clear to one skilled in the art that the embodiments of the invention may be practiced without these particular details. In other instances, well-known circuits, control signals, and timing protocols have not been shown in detail in order to avoid unnecessarily obscuring the embodiments of the invention. Although the embodiments of the present description are directed to synchronous memory devices, the principles described herein are equally applicable to other types of synchronous integrated circuits.

FIG. 2 shows a block diagram of a read latency control system 200 that includes a synchronization circuit 201 configured to provide various signals to a read latency control circuit 202 for synchronizing a read operation with an external clock signal. The synchronization circuit 201 may be a delay locked loop, phase locked loop or another clock management circuit in a synchronous integrated circuit, which are well-known in the art. In one embodiment, the synchronization circuit 201 may be used to generate internal clock signals relative to a received external clock signal EXTCLK to which read data is latched and synchronized in a memory device. The synchronization circuit 201 generates an upstream clock signal CLKUP from the EXTCLK signal and a downstream clock signal CLKDN having a timing relationship relative to the CLKUP signal. The CLKUP signal and the CLKDN signal are applied to the read latency control circuit 202 and used to track internal read clock pulses relative to the EXTCLK signal during a memory read operation. More specifically, the CLKUP and CLKDN signals are used by the read latency control circuit 202 to count the number of clock cycles that are calculated to meet a specified read latency CL for latching the read data. A count signal N_count is provided to the read latency control circuit 202 by the synchronization circuit 201 after the external and internal clocks are synchronized through a series of delay elements. The N_count signal may be a count of the number of clock cycles calculated by the synchronization circuit 201 to achieve the locking condition. An embodiment for calculating the N_count and generating an N_count signal is described in detail in a commonly assigned patent application Ser. No. 11/612,798 (filing date Dec. 19, 2006) entitled TIMING SYNCHRONIZATION CIRCUIT WITH LOOP COUNTER, and is hereby incorporated by reference. The synchronization circuit 201 provides the CLKDN signal to an output driver 225 that generates a read clock signal DQS whose phase is matched to the phase of the EXTCLK signal. By relying on the synchronization circuit 201 for timing signals, the read latency control system 200 eliminates the need for additional circuitry, such as the slave delay circuit 124 of FIG. 1.

As previously explained, the specified read latency CL must be met when read data is accessed from the memory device during a read operation. The specified latency CL is conventionally in the form of a number of clock cycles and may be fixed in the memory device at the time of manufacture or may be a value from an external memory controller (not shown) as part of a command signal and programmed into a mode register (also not shown) in a separate location in the memory device. During a read operation, the read latency control circuit 202 receives a read command signal READ, and upon receiving the CLKUP and the CLKDN signals respectively, the read latency control circuit 202 matches the timing of the READ signal to the internal clock signals of the synchronization circuit 201. The READ signal is initially received by a latch circuit 204 when a countup counter 206 is enabled by the CLKUP signal. The latched READ signal is passed on to a control circuit 278 when a countdn counter 208 is enabled by the CLKDN signal. In response to receiving the combination of the READ signal from the latch circuit 204, the CL signal from the mode register (not shown), and the N_count signal from the synchronization circuit 201, the control circuit 278 synchronizes the READ signal the DQS signal. As a result, the latched read data (not shown) is provided to an output driver 235 and passed on to an output terminal DQ pad 329 (shown in FIG. 3) as data DQ relative to the timing of the DQS signal, and as required by the specified latency CL. Additional delay models that unnecessarily consume extra chip space are thereby reduced by utilizing the CLKUP and CLKDN signals that can be provided directly to the read latency control circuit 202 by the synchronization circuit 201. High speed operations are thus better achieved by utilizing a clock-based control system such as the read latency control system 200.

Embodiments of the synchronization circuit 201 and read latency control circuit 202 are shown in further detail in FIG. 3. With respect to the synchronization circuit, complimentary external clock signal CLK and CLK* signals are applied to an input buffer 310 of a delay locked loop 301 (“DLL”). The input buffer 310 applies a buffered external clock signal CLKS to one of two inputs of a NAND gate tID 312. A second input to the tID gate 312 will be described further in the context of describing an initialization mode circuit 317 that provides the second input. After evaluating its two inputs, the tID gate 312 outputs an upstream CLKUP signal that is applied to a delay line circuit 320. The delay line circuit 320 delays the received CLKUP signal based on a control signal from a phase detector 350, and an output buffer 314 drives the output of the delay line circuit 320 as a downstream signal CLKDN to output drivers 323, 325. The CLKDN signal is eventually passed on as the output signal DQS to the output bus through DQS pad 329 after locking is achieved. The output of the delay line circuit 320 is connected to a feedback loop that returns the signal to the phase detector 350 where a phase difference-related signal that controls one or more delay lines of the delay line circuit 320 is generated by the phase detector 350 and applied to the delay line circuit 320.

In the feedback loop, the CLKDN signal is fed through a series of model delay circuits 354 to generate a delayed feedback signal FBCLK at the output of the model tID gate 354 c. The FBCLK signal includes the delays applied to the CLKDN signal that is approximately equal to the propagation delays due to the input buffer 310, the tID gate 312, and the output drivers 323, 325 so that the delays applied to the external clock signal as it propagates through the DLL circuit 301 are compensated for as the locking condition is achieved. The phase detector 350 compares the FBCLK signal and the received CLKUP signal to generate a control signal that either advances or delays the timing of the received CLKUP signal such that its phase is matched to the phase of the FBCLK signal. The CLKDN signal may be returned through the feedback loop again to determine whether a locking condition has been met or to make further phase adjustments. The phase detector 350 additionally generates an N_count signal representing a count of the number of clock cycles used to adjust the received input signal in order to achieve the phase-locked condition.

The phase detector 350 also generates a clock hold signal DLL_initlock that enables the initialization mode circuit 317 to temporarily lock the delay line circuit 320 causing it to temporarily stop toggling for a predetermined number of clock periods. The DLL_initlock signal is initially applied to a delay line lock control circuit 358 that provides a high control signal to a nor gate 322. The second input to the nor gate 322 is a constant low signal powerdn. The high control signal causes the nor gate 322 to provide a low input signal to a flip-flop 318 whose output provides the second input to the tID gate 312 relative to the rising edge of the CLKS signal. When the second input of the tID gate 312 is low, due to the logic circuit 358 receiving the DLL_initlock signal, the delay line 320 temporarily stops for five clock cycles, as will be further described later. Otherwise, the second input of the tID gate 312 is defaulted to be high during normal operation, in which case the CLKS signal is permitted to pass through to the delay line circuit 320.

As discussed above, FIG. 3 also shows an embodiment of a read latency control circuit 302 in greater detail, which may be used as the read latency control circuit 202 in FIG. 2. The read latency control circuit 302 is enabled when a read command signal READ is received by one input of a command input buffer 313 during a read operation. A V_(REF) signal is applied to the second input of the command buffer 313 to which the READ signal is compared. If the READ signal is greater than V_(REF) a high logic signal is buffered out and applied to a flip-flop 362, which is clocked by the same CLKS signal as the flip-flop 318 of the DLL circuit 301. In response to the rising edge of the CLKS signal, the flip-flop 362 outputs a read command logic signal RDSQ. At that instance, the RDSQ signal is received by whichever one of a set of latches 375 is first enabled to receive the command signal.

Each latch 375 is correspondingly coupled to one of a set of flip-flops 365 of an upstream ring counter 304. The inputs and outputs of the flip-flops 365 are coupled together in series with the output of the last flip-flop 365 n being coupled to the input of the first flip-flop 365 a. Each of the outputs is additionally coupled to a corresponding latch 375. For example, the output of the flip-flop 365 a is coupled to the input of the latch 375 a, the output of the flip-flop 365 b is coupled to the input of the latch 375 b, and so on. If the read latency control circuit 302 includes N number of latches 375, the upstream ring counter 304 will also include N number of flip-flops 365. The output of each flip-flop 365 is initially reset to low before operation, except the last flip-flop 365 n, which is initially reset to high. The flip-flops 365 are clocked by the CLKUP signal from the DLL circuit 301, and on the first rising edge of the CLKUP signal, the high output of the last flip-flop 365 n is received by the input of the first flip-flop 365 a, while all other flip-flops 365 b-n receive the low signal from the preceding outputs. On the second rising edge, the output of the first flip-flop 365 a provides its high input to the second flip-flop 365 b while all other flip-flops receive low input signals. The first flip-flop 365 a additionally provides a high enable signal to the latch 375 a, which allows the latch 375 a to receive the RDSQ signal if available. On the third rising edge of the CLKUP signal, the second flip-flop 365 b provides a high input to the third flip-flop 365 c, which then enables the latch 375 c to receive the RDSQ signal if available. In this manner only one latch 375 is enabled at one time to receive the RDQS signal, and each latch 375 is enabled in sequential order with respect to the flip-flops 365 of the upstream ring counter 304 thereby each latch 375 takes a turn in receiving a high enable signal which allows one of the latches 375 to capture the RDSQ signal relative to the CLKUP signal.

Each latch 375 is also coupled to a respective switch 395 that controls when the latched RDSQ signal is propagated through to an OR gate 368. If there are N latches 375, then there are also N number of switches 395. Similar to the latches 375, each of the switches 395 are controlled by a respectively coupled flip-flop 385 in a downstream ring counter 305. The downstream ring counter 305 includes the same number of flip-flops 385 as there are switches 395. The downstream ring counter 305 functions in the same manner as the upstream counter 304, except that the downstream ring counter 305 is clocked by the CLKDN signal from the DLL circuit 301. Therefore, the first available latch 375 holding the RDSQ signal is pass through to the OR gate 368 by the first enabled switch 395 relative to the CLKDN signal, which is controlled by the downstream ring counter 305 clocked by the CLKDN signal.

When the OR gate 368 determines a match between the latch 375 storing the RDSQ signal and the enabled switch 395, a MATCH signal is generated and provided to a latency circuit 378, which also receives a predetermined specified latency signal from a mode register (not shown) as previously described. The latency circuit 378 additionally receives the N_count signal from the phase detector 350 of the DLL circuit 301. When receiving the combination of the CL, N_count and MATCH signals, the latency circuit 378 generates a LATENCY signal having a delay relative to the CLKDN signal. The LATENCY signal is latched by a flip-flop 363 that is also clocked by the CLKDN signal, and the flip-flop 363 generates an enable signal DQen, to enable pre-output and output drivers 323, 325, 333, 335 of both the DLL circuit 302 and the read latency control circuit 301. The pre-output driver 333 is further coupled to a read latch 366 that latches read data from a memory array (not shown) in response to a memory request. The DQen signal thus enables the pre-output driver 333 to drive the latched read data through an output driver 335 to an output terminal DQ pad 339 relative to the CLKDN signal. At the same time, the DQen signal enables the pre-output driver 323 to drive the CLKDN signal through the output driver 325 to apply the DQS signal to the DQS pad 329. For purposes of matching timing variations, the output drivers 323, 325 are configured in the same manner as the output drivers 333, 335.

In summary, the DLL circuit 301 and the read latency control circuit 302 simultaneously work together to synchronize the DQ data (being applied on the DQ pad 339) with the DQS signal (being applied on the DQS pad 329) as required by the read latency signal CL. As previously described, the operation of the read latency control circuit 302 is initiated upon receiving the READ command signal and latched to one of the latches 375 that is enabled by the upstream ring counter 304 clocked by the CLKUP signal. The CLKDN signal, which is used to generate the DQS signal by the DLL circuit 301, is also utilized by the downstream ring counter 305 to timely enable the switch 395 that corresponds to the latch 375 holding the RDSQ signal. When the OR gate 368 determines a match between the latch 375 and the switch 395, indicating a read request is in queue, the latency circuit 378 generates the LATENCY control signal that determines when to provide the read data from the memory array captured in the read latch 366. The LATENCY control signal is calculated based on values representing the specified read latency CL, the N_count from the phase detector 350, and the MATCH signal from the OR gate 368, all relative to the CLKDN signal. When the flip-flop 363 receives the LATENCY signal, the flip-flop 363 generates the DQen signal, which enables the pre-output driver 333 to pass on the read data to the DQ pad 339 from the read latch 366 relative to the CLKDN signal. The pre-output driver 323 simultaneously receives the DQen signal to drive the CLKDN signal from the delay line circuit 320 to drive the DQS signal to the DQS pad 329 as the DQ data is driven to the DQ pad 339.

The DLL_initlock signal is generated by the phase detector 350 and applied to the delay line lock circuit 358 of the initialization mode circuit 317 such that the flip-flops 318, 362 are synchronized by the same CLKS signal that starts the synchronization operation of the read latency control circuit 302. Since the CLKUP and CLKDN signals of the DLL circuit 302 are free running clocks, the initialization mode circuit 317 allows the read latency control circuit 302 to determine which of the clocks is first. In response to the DLL_initlock signal, the initialization mode circuit 317 locks the delay line circuit 320 for a predetermined number of clock cycles. For example, if the initialization mode circuit 317 is programmed for 5*tCK, the delay line circuit 320 waits five clock cycles and then starts toggling again. Therefore, the first received signal from the DLL circuit 302 is the CLKUP signal, since the CLKDN signal is delayed relative to the CLKUP signal. Additionally, prior to the DLL_initlock signal being received by the control circuit 358, the ring counters 304, 305 are reset by a RESET signal. Therefore, by receiving the DLL_initlock signal, the initialization mode circuit 317 also knows that the ring counters 304, 305 have been reset and are ready to receive the CLKUP and CLKDN signals, respectively.

The required specified read latency is matched by the design of the read latency control circuit 302 such that the sum of delays between the READ signal input of command input buffer 313 to the DQ pad 339 yields the specified read latency value. Since the READ signal is latched by the flip-flop 362 being clocked by the CLKS signal and the MATCH signal is generated relative to the CLKDN signal, the pathway between the command input buffer 313 and the OR gate 368 is designed to have the same delay variations as the EXTCLK-to-CLKDN pathway between the input buffer 310 to the output buffer 314 of the DLL circuit 301 for generating the CLKDN signal. The EXTCLK-to-CLKDN pathway is equal to tIB+tID+(N*tCK−tFB), where tIB is the propagation delay due to the input buffer 310 and the tID is the propagation delay due to the tID gate 312. The N represents the N_count delay calculation and tCK is the clock period. As previously described, the N_count is the number of clock cycles calculated by the phase detector 350 in the synchronization circuit 301 to achieve the locking condition. The entire closed loop delay, from the output of the tID gate 312 (the CLKUP node) to the output of the model tID gate 354 c (the FBCLK node) is always N*tCK after locking has been established. The tFB represents the propagation delays of the feedback loop (tFB=tID+tIB+tDQ). The remaining pathway of the read latency control circuit 302 (from the output of the OR gate 368 to the DQ pad 339) is designed to equal ((CL−N)*tCK−1*tCK)+1*tCK+tDQ, where (CL−N)*tCK−1*tCK is the delay calculated by the latency circuit 378. The 1*tCK is the propagation delay due to the flip-flop 363, and the tDQ is the propagation delay due to the output drivers 333, 335. The sum of the delays for two pathways determines the delay between the command input buffer 313 to the DQ pad 339, which equals CL*tCK, resulting in the calculation for CL, the predetermined read latency value.

A timing diagram summarizing the clock signals of the read latency control circuit 302 of FIG. 3 is shown in FIG. 4. At time T0, the CLKS signal is generated by the DLL circuit 301 in response to receiving the EXTCLK signal, from which the CLKUP and CLKDN signals are also generated. At time T1, the DLL_initlock provided to the initialization mode circuit 317 transitions from low to high to lock the delay line circuit 320 and cause the input clock CLKS to stop toggling for a predetermined time of five clock cycles, shown at time T2A. Also at time T2A, the CLKUP signal is subsequently locked for five clock cycles, followed by the CLKDN signal being locked for five clock cycles at time T2B. As previously described, since the CLKUP and CLKDN are free-running clocks, it is difficult for the read latency control circuit 302 to determine the leading clock signal. After a five clock cycle hold, the CLKUP signal first resumes toggling at time T3, and the upstream ring counter 304 begins to sequentially enable each latch 375 one at a time between times T3 to T4. After some delay, the downstream ring counter 304 begins to sequentially enable each switch 395 one at a time between times T5 to T6. The first read command signal is received and detected as a high signal by the command input buffer 313 at time T6, and the RDSQ signal is latched by the fourth latch 375 d, which is coincidentally the latch that is enabled by the upstream ring counter 304 at time T7A. A second READ command signal is received at time T7B, which is latched by the next enabled latch 375 b at time T8. Also at time T8, the switch 395 d coupled to the fourth latch 375 d is enabled by the downstream ring counter 305 and a first MATCH signal is generated corresponding to the first received READ command signal. At the next available rising edge of the CLKDN signal, the downstream ring counter 305 enables the switch 395 b corresponding to the latch 375 b holding the second received READ signal, and a second MATCH signal is generated at time T9. The first and second MATCH signals are received in order by the latency circuit 378, and first and second LATENCY signals are generated in that order at times T9 and T10, respectively. In response, read data corresponding to the first READ signal is applied to the DQ pad 339 at time T11 and the read data corresponding to the second READ signal is applied to the DQ pad 339 at time T12. For the example shown in FIG. 4, the read latency value CL is predetermined and provided by either an external controller or a mode register as previously described, and is set to equal 8. Therefore, the read latency count for the first READ signal is shown as eight clock cycle from the time the READ signal is received at time T6 to the time the first read data is applied on the DQ pad 339 at time T11. As a result, the specified read latency has been met. The read latency count for the second READ signal is similarly indicated between times T7B and T12.

FIG. 5 is a functional block diagram of an embodiment of a memory device 500 that includes the DLL circuit 301 and the read latency control circuit 302 of FIG. 3 or a DLL circuit and read latency control circuit according to another embodiment of the invention. The memory device 500 in FIG. 5 is a double-data rate (“DDR”) SDRAM, although the principles described herein are applicable to any memory device that may include a circuit for synchronizing internal and external signals, such as conventional SDRAMs, as well as packetized memory devices like SLDRAMs and RDRAMs, and are equally applicable to any integrated circuit that must synchronize internal and external clocking signals.

The memory device 500 includes an address register 502 that receives row, column, and bank addresses over an address bus ADDR, with a memory controller (not shown) typically supplying the addresses. The address register 502 receives a row address and a bank address that are applied to a row address multiplexer 504 and bank control logic circuit 506, respectively. The row address multiplexer 504 applies either the row address received from the address register 502 or a refresh row address from a refresh counter 508 to a plurality of row address latch and decoders 510A-D. The bank control logic 506 activates the row address latch and decoder 510A-D corresponding to either the bank address received from the address register 502 or a refresh bank address from the refresh counter 508, and the activated row address latch and decoder latches and decodes the received row address. In response to the decoded row address, the activated row address latch and decoder 510A-D applies various signals to a corresponding memory bank 512A-D to thereby activate a row of memory cells corresponding to the decoded row address. Each memory bank 512A-D includes a memory-cell array having a plurality of memory cells arranged in rows and columns, and the data stored in the memory cells in the activated row is stored in sense amplifiers in the corresponding memory bank. The row address multiplexer 504 applies the refresh row address from the refresh counter 508 to the decoders 510A-D and the bank control logic circuit 506 uses the refresh bank address from the refresh counter when the memory device 500 operates in an auto-refresh or self-refresh mode of operation in response to an auto- or self-refresh command being applied to the memory device 500, as will be appreciated by those skilled in the art.

A column address is applied on the ADDR bus after the row and bank addresses, and the address register 502 applies the column address to a column address counter and latch 514 which, in turn, latches the column address and applies the latched column address to a plurality of column decoders 516A-D. The bank control logic 506 activates the column decoder 516A-D corresponding to the received bank address, and the activated column decoder decodes the applied column address. Depending on the operating mode of the memory device 500, the column address counter and latch 514 either directly applies the latched column address to the decoders 516A-D, or applies a sequence of column addresses to the decoders starting at the column address provided by the address register 502. In response to the column address from the counter and latch 514, the activated column decoder 516A-D applies decode and control signals to an I/O gating and data masking circuit 518 which, in turn, accesses memory cells corresponding to the decoded column address in the activated row of memory cells in the memory bank 512A-D being accessed.

During data read operations, data being read from the addressed memory cells is coupled through the I/O gating and data masking circuit 518 to a read latch 520. The I/O gating and data masking circuit 518 supplies N bits of data to the read latch 520, which then applies two N/2 bit words to a multiplexer 522. The circuit 518 provides 64 bits to the read latch 520 which, in turn, provides two 32 bits words to the multiplexer 522. A data driver 524 sequentially receives the N/2 bit words from the multiplexer 522 and also receives a data signal DQS from the clock generator 523. A read latency control circuit 526 is additional coupled to the read latch 520, and receives internal clock signals CLKUP and CLKDN from the clock generator 523. The DQS signal is used by an external circuit such as a memory controller (not shown) in latching data from the memory device 500 during read operations. As previously described, the read latency control circuit 526 controls the read latch 520 to synchronize the read data output relative to the DQS signal such that a specified read latency is met. In response to the DQS signal and a LATENCY control signal provided by the read latency control circuit 526, the data driver 524 sequentially outputs the received N/2 bits words as a corresponding data word DQ, each data word being output in synchronism with rising or falling edges of the CLK signal that is applied to clock the memory device 500. The data driver 524 also outputs the data clock signal DQS in synchronism with rising or falling edges of the CLK signal. Each data word DQ and the data clock signal DQS collectively define a data bus DATA. As will be appreciated by those skilled in the art, the data clock signal DQS is a delayed version of the CLK signal, and the clock generator 523 adjusts the DQS signal relative to the CLK signal to ensure that the DQS signal and the DQ words are placed on the DATA bus in synchronism with the CLK signal, as previously described. The DATA bus also includes masking signals DM0-M, which will be described in more detail below with reference to data write operations.

During data write operations, an external circuit such as a memory controller (not shown) applies N/2 bit data words DQ, the DQS signal, and corresponding data masking signals DM on the data bus DATA. A data receiver 528 receives each DQ word and the associated DM signals, and applies these signals to input registers 530 that are clocked by the DQS signal. In response to a rising edge of the DQS signal, the input registers 530 latch a first N/2 bit DQ word and the associated DM signals, and in response to a falling edge of the DQS signal the input registers latch the second N/2 bit DQ word and associated DM signals. The input register 530 provides the two latched N/2 bit DQ words as an N-bit word to a write FIFO and driver 532, which clocks the applied DQ word and DM signals into the write FIFO and driver in response to the DQS signal. The DQ word is clocked out of the write FIFO and driver 532 in response to the CLK signal, and is applied to the I/O gating and masking circuit 518. The I/O gating and masking circuit 518 transfers the DQ word to the addressed memory cells in the accessed bank 512A-D subject to the DM signals, which may be used to selectively mask bits or groups of bits in the DQ words (i.e., in the write data) being written to the addressed memory cells.

A control logic and command decoder 534 receives a plurality of command and clocking signals over a control bus CONT, typically from an external circuit such as a memory controller (not shown). The command signals include a chip select signal CS*, a write enable signal WE*, a column address strobe signal CAS*, and a row address strobe signal RAS*, while the clocking signals include a clock enable signal CKE* and complementary clock signals CLK, CLK*, with the “*” designating a signal as being active low. The command signals CS*, WE*, CAS*, and RAS* are driven to values corresponding to a particular command, such as a read, write, or auto-refresh command. In response to the clock signal CLK or the combination of clock signals CLK, CLK*, the command decoder 534 latches and decodes an applied command, and generates a sequence of clocking and control signals that control the components 502-532 to execute the function of the applied command. The clock enable signal CKE enables clocking of the command decoder 534 by the clock signal CLK or clock signals CLK, CLK*. The command decoder 534 latches command and address signals at edges of the CLK signal or CLK and CLK* signals (i.e., the crossing point of CLK going high and CLK* going low), while the input registers 530 and data drivers 524 transfer data into and from, respectively, the memory device 500 in response to the edges of the data strobe signal DQS and thus at the frequency of the clock signal CLK or clock signals CLK, CLK*. This is true because the DQS signal has the same frequency as the clock signals. The memory device 500 is referred to as a double-data-rate device because the data words DQ being transferred to and from the device are transferred at double the rate of a conventional SDRAM, which transfers data at a rate corresponding to the frequency of the applied clock signal. The detailed operation of the control logic and command decoder 534 in generating the control and timing signals is conventional, and thus, for the sake of brevity, will not be described in more detail.

FIG. 6 is a block diagram of an embodiment of a processor-based system 600 including processor circuitry 602, which includes the memory device 500 of FIG. 5 or a memory device according to some other embodiment of the invention. Conventionally, the processor circuitry 602 is coupled through address, data, and control buses to the memory device 500 to provide for writing data to and reading data from the memory device 500. The processor circuitry 602 includes circuitry for performing various processing functions, such as executing specific software to perform specific calculations or tasks. In addition, the processor-based system 600 includes one or more input devices 604, such as a keyboard or a mouse, coupled to the processor circuitry 602 to allow an operator to interface with the processor-based system 600. Typically, the processor-based system 600 also includes one or more output devices 606 coupled to the processor circuitry 602, such as output devices typically including a printer and a video terminal. One or more data storage devices 608 are also typically coupled to the processor circuitry 602 to store data or retrieve data from external storage media (not shown). Examples of typical storage devices 608 include hard and floppy disks, tape cassettes, compact disk read-only (“CD-ROMs”) and compact disk read-write (“CD-RW”) memories, and digital video disks (“DVDs”).

From the foregoing it will be appreciated that, although specific embodiments of the invention have been described herein for purposes of illustration, various modifications may be made without deviating from the spirit and scope of the embodiments of the invention. For example, many of the components described above may be implemented using either digital or analog circuitry, or a combination of both. 

What is claimed is:
 1. A latency control apparatus, comprising: a synchronization circuit comprising: a delay line configured to receive an input clock signal and delay the input clock signal by a delay amount to provide a delayed input clock signal; and a feedback circuit coupled to the delay line and configured to receive the input clock signal and the delayed input clock signal, the feedback circuit further configured to provide a count signal responsive to achieving a locking, condition, the count signal indicative of a number of clock cycles used to achieve the locking condition between the input clock signal and the delayed input clock signal; and a read latency control circuit configured to delay a read signal based, at least in part, on the delay amount.
 2. The latency control apparatus of claim 1, wherein the feedback circuit comprises a phase detector configured to adjust a timing of the input clock signal and provide the count signal.
 3. The latency control apparatus of claim 2, wherein the feedback circuit is further configured to delay the delayed input clock signal by an integer number of clock cycles responsive, at least in part, to adjusting the timing of the input clock signal.
 4. The latency control apparatus of claim 1, wherein the synchronization circuit further comprises: an initialization circuit configured to prevent the delay line from toggling for a predetermined number of clock cycles responsive, at least in pan, to receipt of a signal from the feedback circuit.
 5. The latency control apparatus of claim 1, wherein the read latency control circuit is further configured to delay the read signal based, at least in part, on a match signal.
 6. The latency control apparatus of claim 5, further comprising: a latch configured to capture the read signal based, at least in part, on the input clock signal.
 7. The latency control apparatus of claim 1, wherein the read latency control circuit is further configured to delay the read signal based, at least in part, on a latency signal received from a mode register.
 8. A method of providing read data with a latency, comprising: receiving an input clock signal; delaying the input clock signal by a first amount to provide an internal clock signal; providing a count signal responsive to achieving a phase-locked condition, the count signal based, at least in part, on the first amount and indicative of a number of clock cycles used to delay the input clock signal to achieve the phase-locked condition; and delaying a read signal by a second amount based, at least in part, on the count signal.
 9. The method of claim 8, further comprising: providing the read signal to an output driver based, at least in part, on the internal clock signal.
 10. The method of claim 8, wherein the second amount is based, at least in part, on a match signal.
 11. The method of claim 8, wherein said delaying the read signal comprises: latching the read signal based, at least in part, on the input clock signal; and providing a match signal based at least in part, on the internal clock signal.
 12. The method of claim 8, wherein the first amount is based, at least in part, on a delay of a delay line.
 13. The method of claim 8, further comprising: before said providing a count signal, locking a delay line for a predetermined number of clock cycles.
 14. The method of claim 8, wherein the count signal represents a number of dock cycles. 